Overview

Dataset statistics

Number of variables35
Number of observations275
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory75.3 KiB
Average record size in memory280.5 B

Variable types

Numeric7
Categorical28

Warnings

ConsumoAlcoolDiaUtil_Pt is highly correlated with ConsumoAlcoolFimSemana_PtHigh correlation
ConsumoAlcoolFimSemana_Pt is highly correlated with ConsumoAlcoolDiaUtil_PtHigh correlation
NotaP1_Pt is highly correlated with NotaP2_Pt and 2 other fieldsHigh correlation
NotaP2_Pt is highly correlated with NotaP1_Pt and 2 other fieldsHigh correlation
NotaP3_Pt is highly correlated with NotaP1_Pt and 2 other fieldsHigh correlation
MediaNotas_Pt is highly correlated with NotaP1_Pt and 2 other fieldsHigh correlation
ConsumoAlcoolDiaUtil_Pt is highly correlated with ConsumoAlcoolFimSemana_PtHigh correlation
ConsumoAlcoolFimSemana_Pt is highly correlated with ConsumoAlcoolDiaUtil_PtHigh correlation
NotaP1_Pt is highly correlated with NotaP2_Pt and 2 other fieldsHigh correlation
NotaP2_Pt is highly correlated with NotaP1_Pt and 2 other fieldsHigh correlation
NotaP3_Pt is highly correlated with NotaP1_Pt and 2 other fieldsHigh correlation
MediaNotas_Pt is highly correlated with NotaP1_Pt and 2 other fieldsHigh correlation
ConsumoAlcoolDiaUtil_Pt is highly correlated with ConsumoAlcoolFimSemana_PtHigh correlation
ConsumoAlcoolFimSemana_Pt is highly correlated with ConsumoAlcoolDiaUtil_PtHigh correlation
NotaP1_Pt is highly correlated with NotaP2_Pt and 2 other fieldsHigh correlation
NotaP2_Pt is highly correlated with NotaP1_Pt and 2 other fieldsHigh correlation
NotaP3_Pt is highly correlated with NotaP1_Pt and 2 other fieldsHigh correlation
MediaNotas_Pt is highly correlated with NotaP1_Pt and 2 other fieldsHigh correlation
NotaP2_Pt is highly correlated with MediaNotas_Pt and 2 other fieldsHigh correlation
OcupacaoMae is highly correlated with OcupacaoPai and 2 other fieldsHigh correlation
ConsumoAlcoolDiaUtil_Pt is highly correlated with ConsumoAlcoolFimSemana_PtHigh correlation
OcupacaoPai is highly correlated with OcupacaoMae and 1 other fieldsHigh correlation
df_index is highly correlated with EscolaHigh correlation
Faltas_Pt is highly correlated with IdadeHigh correlation
Escola is highly correlated with df_indexHigh correlation
ConsumoAlcoolFimSemana_Pt is highly correlated with ConsumoAlcoolDiaUtil_PtHigh correlation
MediaNotas_Pt is highly correlated with NotaP2_Pt and 2 other fieldsHigh correlation
InstrucaoMae is highly correlated with OcupacaoMae and 1 other fieldsHigh correlation
NotaP3_Pt is highly correlated with NotaP2_Pt and 3 other fieldsHigh correlation
SaiComAmigos_Pt is highly correlated with TempoLivre_PtHigh correlation
Reprovacoes_Pt is highly correlated with NotaP3_PtHigh correlation
NotaP1_Pt is highly correlated with NotaP2_Pt and 2 other fieldsHigh correlation
InstrucaoPai is highly correlated with OcupacaoMae and 2 other fieldsHigh correlation
Idade is highly correlated with Faltas_PtHigh correlation
TempoLivre_Pt is highly correlated with SaiComAmigos_PtHigh correlation
df_index is uniformly distributed Uniform
df_index has unique values Unique
Faltas_Pt has 97 (35.3%) zeros Zeros
NotaP2_Pt has 7 (2.5%) zeros Zeros
NotaP3_Pt has 10 (3.6%) zeros Zeros

Reproduction

Analysis started2021-05-26 21:56:23.524210
Analysis finished2021-05-26 21:56:34.732778
Duration11.21 seconds
Software versionpandas-profiling v3.0.0
Download configurationconfig.json

Variables

df_index
Real number (ℝ≥0)

HIGH CORRELATION
UNIFORM
UNIQUE

Distinct275
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean544
Minimum407
Maximum681
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.3 KiB

Quantile statistics

Minimum407
5-th percentile420.7
Q1475.5
median544
Q3612.5
95-th percentile667.3
Maximum681
Range274
Interquartile range (IQR)137

Descriptive statistics

Standard deviation79.5298686
Coefficient of variation (CV)0.1461946114
Kurtosis-1.2
Mean544
Median Absolute Deviation (MAD)69
Skewness0
Sum149600
Variance6325
MonotonicityStrictly increasing
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
5121
 
0.4%
4181
 
0.4%
4241
 
0.4%
4231
 
0.4%
4221
 
0.4%
4211
 
0.4%
4201
 
0.4%
4191
 
0.4%
4171
 
0.4%
4431
 
0.4%
Other values (265)265
96.4%
ValueCountFrequency (%)
4071
0.4%
4081
0.4%
4091
0.4%
4101
0.4%
4111
0.4%
4121
0.4%
4131
0.4%
4141
0.4%
4151
0.4%
4161
0.4%
ValueCountFrequency (%)
6811
0.4%
6801
0.4%
6791
0.4%
6781
0.4%
6771
0.4%
6761
0.4%
6751
0.4%
6741
0.4%
6731
0.4%
6721
0.4%

Escola
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
Mousinho da Silveira
186 
Gabriel Pereira
89 

Length

Max length20
Median length20
Mean length18.38181818
Min length15

Characters and Unicode

Total characters5055
Distinct characters18
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowGabriel Pereira
2nd rowGabriel Pereira
3rd rowGabriel Pereira
4th rowGabriel Pereira
5th rowGabriel Pereira

Common Values

ValueCountFrequency (%)
Mousinho da Silveira186
67.6%
Gabriel Pereira89
32.4%

Length

Histogram of lengths of the category

Pie chart

ValueCountFrequency (%)
da186
25.3%
mousinho186
25.3%
silveira186
25.3%
pereira89
12.1%
gabriel89
12.1%

Most occurring characters

ValueCountFrequency (%)
i736
14.6%
a550
10.9%
461
 
9.1%
r453
 
9.0%
e453
 
9.0%
o372
 
7.4%
l275
 
5.4%
M186
 
3.7%
u186
 
3.7%
s186
 
3.7%
Other values (8)1197
23.7%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter4044
80.0%
Uppercase Letter550
 
10.9%
Space Separator461
 
9.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
i736
18.2%
a550
13.6%
r453
11.2%
e453
11.2%
o372
9.2%
l275
 
6.8%
u186
 
4.6%
s186
 
4.6%
n186
 
4.6%
h186
 
4.6%
Other values (3)461
11.4%
Uppercase Letter
ValueCountFrequency (%)
M186
33.8%
S186
33.8%
G89
16.2%
P89
16.2%
Space Separator
ValueCountFrequency (%)
461
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin4594
90.9%
Common461
 
9.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
i736
16.0%
a550
12.0%
r453
9.9%
e453
9.9%
o372
 
8.1%
l275
 
6.0%
M186
 
4.0%
u186
 
4.0%
s186
 
4.0%
n186
 
4.0%
Other values (7)1011
22.0%
Common
ValueCountFrequency (%)
461
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII5055
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
i736
14.6%
a550
10.9%
461
 
9.1%
r453
 
9.0%
e453
 
9.0%
o372
 
7.4%
l275
 
5.4%
M186
 
3.7%
u186
 
3.7%
s186
 
3.7%
Other values (8)1197
23.7%

Genero
Categorical

Distinct2
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
Feminino
185 
Masculino
90 

Length

Max length9
Median length8
Mean length8.327272727
Min length8

Characters and Unicode

Total characters2290
Distinct characters12
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowMasculino
2nd rowFeminino
3rd rowFeminino
4th rowMasculino
5th rowFeminino

Common Values

ValueCountFrequency (%)
Feminino185
67.3%
Masculino90
32.7%

Length

Histogram of lengths of the category

Pie chart

ValueCountFrequency (%)
feminino185
67.3%
masculino90
32.7%

Most occurring characters

ValueCountFrequency (%)
i460
20.1%
n460
20.1%
o275
12.0%
F185
8.1%
e185
8.1%
m185
8.1%
M90
 
3.9%
a90
 
3.9%
s90
 
3.9%
c90
 
3.9%
Other values (2)180
 
7.9%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter2015
88.0%
Uppercase Letter275
 
12.0%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
i460
22.8%
n460
22.8%
o275
13.6%
e185
9.2%
m185
9.2%
a90
 
4.5%
s90
 
4.5%
c90
 
4.5%
u90
 
4.5%
l90
 
4.5%
Uppercase Letter
ValueCountFrequency (%)
F185
67.3%
M90
32.7%

Most occurring scripts

ValueCountFrequency (%)
Latin2290
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
i460
20.1%
n460
20.1%
o275
12.0%
F185
8.1%
e185
8.1%
m185
8.1%
M90
 
3.9%
a90
 
3.9%
s90
 
3.9%
c90
 
3.9%
Other values (2)180
 
7.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII2290
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
i460
20.1%
n460
20.1%
o275
12.0%
F185
8.1%
e185
8.1%
m185
8.1%
M90
 
3.9%
a90
 
3.9%
s90
 
3.9%
c90
 
3.9%
Other values (2)180
 
7.9%

Idade
Real number (ℝ≥0)

HIGH CORRELATION

Distinct7
Distinct (%)2.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean16.96363636
Minimum15
Maximum21
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.3 KiB

Quantile statistics

Minimum15
5-th percentile15
Q116
median17
Q318
95-th percentile19
Maximum21
Range6
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.237545787
Coefficient of variation (CV)0.07295285991
Kurtosis0.00786585643
Mean16.96363636
Median Absolute Deviation (MAD)1
Skewness0.4185722799
Sum4665
Variance1.531519575
MonotonicityNot monotonic
Histogram with fixed size bins (bins=7)
ValueCountFrequency (%)
1781
29.5%
1674
26.9%
1861
22.2%
1531
 
11.3%
1921
 
7.6%
205
 
1.8%
212
 
0.7%
ValueCountFrequency (%)
1531
 
11.3%
1674
26.9%
1781
29.5%
1861
22.2%
1921
 
7.6%
205
 
1.8%
212
 
0.7%
ValueCountFrequency (%)
212
 
0.7%
205
 
1.8%
1921
 
7.6%
1861
22.2%
1781
29.5%
1674
26.9%
1531
 
11.3%

Endereco
Categorical

Distinct2
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
Urbano
159 
Rural
116 

Length

Max length6
Median length6
Mean length5.578181818
Min length5

Characters and Unicode

Total characters1534
Distinct characters9
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowUrbano
2nd rowUrbano
3rd rowUrbano
4th rowUrbano
5th rowUrbano

Common Values

ValueCountFrequency (%)
Urbano159
57.8%
Rural116
42.2%

Length

Histogram of lengths of the category

Pie chart

ValueCountFrequency (%)
urbano159
57.8%
rural116
42.2%

Most occurring characters

ValueCountFrequency (%)
r275
17.9%
a275
17.9%
U159
10.4%
b159
10.4%
n159
10.4%
o159
10.4%
R116
7.6%
u116
7.6%
l116
7.6%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter1259
82.1%
Uppercase Letter275
 
17.9%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
r275
21.8%
a275
21.8%
b159
12.6%
n159
12.6%
o159
12.6%
u116
9.2%
l116
9.2%
Uppercase Letter
ValueCountFrequency (%)
U159
57.8%
R116
42.2%

Most occurring scripts

ValueCountFrequency (%)
Latin1534
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
r275
17.9%
a275
17.9%
U159
10.4%
b159
10.4%
n159
10.4%
o159
10.4%
R116
7.6%
u116
7.6%
l116
7.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII1534
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
r275
17.9%
a275
17.9%
U159
10.4%
b159
10.4%
n159
10.4%
o159
10.4%
R116
7.6%
u116
7.6%
l116
7.6%

TamFamilia
Categorical

Distinct2
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
GT3
187 
LE3
88 

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters825
Distinct characters5
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowGT3
2nd rowGT3
3rd rowLE3
4th rowGT3
5th rowGT3

Common Values

ValueCountFrequency (%)
GT3187
68.0%
LE388
32.0%

Length

Histogram of lengths of the category

Pie chart

ValueCountFrequency (%)
gt3187
68.0%
le388
32.0%

Most occurring characters

ValueCountFrequency (%)
3275
33.3%
G187
22.7%
T187
22.7%
L88
 
10.7%
E88
 
10.7%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter550
66.7%
Decimal Number275
33.3%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
G187
34.0%
T187
34.0%
L88
16.0%
E88
16.0%
Decimal Number
ValueCountFrequency (%)
3275
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin550
66.7%
Common275
33.3%

Most frequent character per script

Latin
ValueCountFrequency (%)
G187
34.0%
T187
34.0%
L88
16.0%
E88
16.0%
Common
ValueCountFrequency (%)
3275
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII825
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
3275
33.3%
G187
22.7%
T187
22.7%
L88
 
10.7%
E88
 
10.7%

StatusPais
Categorical

Distinct2
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
Juntos
233 
Separados
42 

Length

Max length9
Median length6
Mean length6.458181818
Min length6

Characters and Unicode

Total characters1776
Distinct characters12
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowJuntos
2nd rowJuntos
3rd rowSeparados
4th rowJuntos
5th rowSeparados

Common Values

ValueCountFrequency (%)
Juntos233
84.7%
Separados42
 
15.3%

Length

Histogram of lengths of the category

Pie chart

ValueCountFrequency (%)
juntos233
84.7%
separados42
 
15.3%

Most occurring characters

ValueCountFrequency (%)
o275
15.5%
s275
15.5%
J233
13.1%
u233
13.1%
n233
13.1%
t233
13.1%
a84
 
4.7%
S42
 
2.4%
e42
 
2.4%
p42
 
2.4%
Other values (2)84
 
4.7%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter1501
84.5%
Uppercase Letter275
 
15.5%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
o275
18.3%
s275
18.3%
u233
15.5%
n233
15.5%
t233
15.5%
a84
 
5.6%
e42
 
2.8%
p42
 
2.8%
r42
 
2.8%
d42
 
2.8%
Uppercase Letter
ValueCountFrequency (%)
J233
84.7%
S42
 
15.3%

Most occurring scripts

ValueCountFrequency (%)
Latin1776
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
o275
15.5%
s275
15.5%
J233
13.1%
u233
13.1%
n233
13.1%
t233
13.1%
a84
 
4.7%
S42
 
2.4%
e42
 
2.4%
p42
 
2.4%
Other values (2)84
 
4.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII1776
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
o275
15.5%
s275
15.5%
J233
13.1%
u233
13.1%
n233
13.1%
t233
13.1%
a84
 
4.7%
S42
 
2.4%
e42
 
2.4%
p42
 
2.4%
Other values (2)84
 
4.7%

InstrucaoMae
Categorical

HIGH CORRELATION

Distinct5
Distinct (%)1.8%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
Fundamental1
92 
Fundamental2
90 
EMedio
46 
Superior
44 
Nenhuma
 
3

Length

Max length12
Median length12
Mean length10.30181818
Min length6

Characters and Unicode

Total characters2833
Distinct characters20
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowFundamental2
2nd rowFundamental2
3rd rowFundamental2
4th rowFundamental2
5th rowFundamental2

Common Values

ValueCountFrequency (%)
Fundamental192
33.5%
Fundamental290
32.7%
EMedio46
16.7%
Superior44
16.0%
Nenhuma3
 
1.1%

Length

Histogram of lengths of the category

Pie chart

ValueCountFrequency (%)
fundamental192
33.5%
fundamental290
32.7%
emedio46
16.7%
superior44
16.0%
nenhuma3
 
1.1%

Most occurring characters

ValueCountFrequency (%)
n367
13.0%
a367
13.0%
e275
9.7%
u229
8.1%
d228
8.0%
m185
 
6.5%
F182
 
6.4%
t182
 
6.4%
l182
 
6.4%
192
 
3.2%
Other values (10)544
19.2%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter2330
82.2%
Uppercase Letter321
 
11.3%
Decimal Number182
 
6.4%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
n367
15.8%
a367
15.8%
e275
11.8%
u229
9.8%
d228
9.8%
m185
7.9%
t182
7.8%
l182
7.8%
i90
 
3.9%
o90
 
3.9%
Other values (3)135
 
5.8%
Uppercase Letter
ValueCountFrequency (%)
F182
56.7%
E46
 
14.3%
M46
 
14.3%
S44
 
13.7%
N3
 
0.9%
Decimal Number
ValueCountFrequency (%)
192
50.5%
290
49.5%

Most occurring scripts

ValueCountFrequency (%)
Latin2651
93.6%
Common182
 
6.4%

Most frequent character per script

Latin
ValueCountFrequency (%)
n367
13.8%
a367
13.8%
e275
10.4%
u229
8.6%
d228
8.6%
m185
7.0%
F182
6.9%
t182
6.9%
l182
6.9%
i90
 
3.4%
Other values (8)364
13.7%
Common
ValueCountFrequency (%)
192
50.5%
290
49.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII2833
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
n367
13.0%
a367
13.0%
e275
9.7%
u229
8.1%
d228
8.0%
m185
 
6.5%
F182
 
6.4%
t182
 
6.4%
l182
 
6.4%
192
 
3.2%
Other values (10)544
19.2%

InstrucaoPai
Categorical

HIGH CORRELATION

Distinct5
Distinct (%)1.8%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
Fundamental2
104 
Fundamental1
99 
EMedio
34 
Superior
33 
Nenhuma
 
5

Length

Max length12
Median length12
Mean length10.68727273
Min length6

Characters and Unicode

Total characters2939
Distinct characters20
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowEMedio
2nd rowFundamental1
3rd rowFundamental1
4th rowFundamental1
5th rowFundamental2

Common Values

ValueCountFrequency (%)
Fundamental2104
37.8%
Fundamental199
36.0%
EMedio34
 
12.4%
Superior33
 
12.0%
Nenhuma5
 
1.8%

Length

Histogram of lengths of the category

Pie chart

ValueCountFrequency (%)
fundamental2104
37.8%
fundamental199
36.0%
emedio34
 
12.4%
superior33
 
12.0%
nenhuma5
 
1.8%

Most occurring characters

ValueCountFrequency (%)
n411
14.0%
a411
14.0%
e275
9.4%
u241
8.2%
d237
8.1%
m208
7.1%
F203
6.9%
t203
6.9%
l203
6.9%
2104
 
3.5%
Other values (10)443
15.1%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter2427
82.6%
Uppercase Letter309
 
10.5%
Decimal Number203
 
6.9%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
n411
16.9%
a411
16.9%
e275
11.3%
u241
9.9%
d237
9.8%
m208
8.6%
t203
8.4%
l203
8.4%
i67
 
2.8%
o67
 
2.8%
Other values (3)104
 
4.3%
Uppercase Letter
ValueCountFrequency (%)
F203
65.7%
E34
 
11.0%
M34
 
11.0%
S33
 
10.7%
N5
 
1.6%
Decimal Number
ValueCountFrequency (%)
2104
51.2%
199
48.8%

Most occurring scripts

ValueCountFrequency (%)
Latin2736
93.1%
Common203
 
6.9%

Most frequent character per script

Latin
ValueCountFrequency (%)
n411
15.0%
a411
15.0%
e275
10.1%
u241
8.8%
d237
8.7%
m208
7.6%
F203
7.4%
t203
7.4%
l203
7.4%
i67
 
2.4%
Other values (8)277
10.1%
Common
ValueCountFrequency (%)
2104
51.2%
199
48.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII2939
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
n411
14.0%
a411
14.0%
e275
9.4%
u241
8.2%
d237
8.1%
m208
7.1%
F203
6.9%
t203
6.9%
l203
6.9%
2104
 
3.5%
Other values (10)443
15.1%

OcupacaoMae
Categorical

HIGH CORRELATION

Distinct5
Distinct (%)1.8%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
Outras
122 
Do Lar
82 
Servidor
42 
Saude
15 
Professor
14 

Length

Max length9
Median length6
Mean length6.403636364
Min length5

Characters and Unicode

Total characters1761
Distinct characters17
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowOutras
2nd rowServidor
3rd rowOutras
4th rowDo Lar
5th rowOutras

Common Values

ValueCountFrequency (%)
Outras122
44.4%
Do Lar82
29.8%
Servidor42
 
15.3%
Saude15
 
5.5%
Professor14
 
5.1%

Length

Histogram of lengths of the category

Pie chart

ValueCountFrequency (%)
outras122
34.2%
do82
23.0%
lar82
23.0%
servidor42
 
11.8%
saude15
 
4.2%
professor14
 
3.9%

Most occurring characters

ValueCountFrequency (%)
r316
17.9%
a219
12.4%
o152
8.6%
s150
8.5%
u137
7.8%
O122
 
6.9%
t122
 
6.9%
D82
 
4.7%
82
 
4.7%
L82
 
4.7%
Other values (7)297
16.9%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter1322
75.1%
Uppercase Letter357
 
20.3%
Space Separator82
 
4.7%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
r316
23.9%
a219
16.6%
o152
11.5%
s150
11.3%
u137
10.4%
t122
 
9.2%
e71
 
5.4%
d57
 
4.3%
v42
 
3.2%
i42
 
3.2%
Uppercase Letter
ValueCountFrequency (%)
O122
34.2%
D82
23.0%
L82
23.0%
S57
16.0%
P14
 
3.9%
Space Separator
ValueCountFrequency (%)
82
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin1679
95.3%
Common82
 
4.7%

Most frequent character per script

Latin
ValueCountFrequency (%)
r316
18.8%
a219
13.0%
o152
9.1%
s150
8.9%
u137
8.2%
O122
 
7.3%
t122
 
7.3%
D82
 
4.9%
L82
 
4.9%
e71
 
4.2%
Other values (6)226
13.5%
Common
ValueCountFrequency (%)
82
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII1761
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
r316
17.9%
a219
12.4%
o152
8.6%
s150
8.5%
u137
7.8%
O122
 
6.9%
t122
 
6.9%
D82
 
4.7%
82
 
4.7%
L82
 
4.7%
Other values (7)297
16.9%

OcupacaoPai
Categorical

HIGH CORRELATION

Distinct5
Distinct (%)1.8%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
Outras
160 
Servidor
76 
Do Lar
26 
Professor
 
7
Saude
 
6

Length

Max length9
Median length6
Mean length6.607272727
Min length5

Characters and Unicode

Total characters1817
Distinct characters17
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowOutras
2nd rowOutras
3rd rowOutras
4th rowOutras
5th rowOutras

Common Values

ValueCountFrequency (%)
Outras160
58.2%
Servidor76
27.6%
Do Lar26
 
9.5%
Professor7
 
2.5%
Saude6
 
2.2%

Length

Histogram of lengths of the category

Pie chart

ValueCountFrequency (%)
outras160
53.2%
servidor76
25.2%
do26
 
8.6%
lar26
 
8.6%
professor7
 
2.3%
saude6
 
2.0%

Most occurring characters

ValueCountFrequency (%)
r352
19.4%
a192
10.6%
s174
9.6%
u166
9.1%
O160
8.8%
t160
8.8%
o116
 
6.4%
e89
 
4.9%
S82
 
4.5%
d82
 
4.5%
Other values (7)244
13.4%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter1490
82.0%
Uppercase Letter301
 
16.6%
Space Separator26
 
1.4%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
r352
23.6%
a192
12.9%
s174
11.7%
u166
11.1%
t160
10.7%
o116
 
7.8%
e89
 
6.0%
d82
 
5.5%
v76
 
5.1%
i76
 
5.1%
Uppercase Letter
ValueCountFrequency (%)
O160
53.2%
S82
27.2%
D26
 
8.6%
L26
 
8.6%
P7
 
2.3%
Space Separator
ValueCountFrequency (%)
26
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin1791
98.6%
Common26
 
1.4%

Most frequent character per script

Latin
ValueCountFrequency (%)
r352
19.7%
a192
10.7%
s174
9.7%
u166
9.3%
O160
8.9%
t160
8.9%
o116
 
6.5%
e89
 
5.0%
S82
 
4.6%
d82
 
4.6%
Other values (6)218
12.2%
Common
ValueCountFrequency (%)
26
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII1817
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
r352
19.4%
a192
10.6%
s174
9.6%
u166
9.1%
O160
8.8%
t160
8.8%
o116
 
6.4%
e89
 
4.9%
S82
 
4.5%
d82
 
4.5%
Other values (7)244
13.4%

MotivoEscolha
Categorical

Distinct4
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
Curso
147 
Reputação
45 
Distancia
45 
Outros
38 

Length

Max length9
Median length5
Mean length6.447272727
Min length5

Characters and Unicode

Total characters1773
Distinct characters17
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowCurso
2nd rowReputação
3rd rowCurso
4th rowCurso
5th rowDistancia

Common Values

ValueCountFrequency (%)
Curso147
53.5%
Reputação45
 
16.4%
Distancia45
 
16.4%
Outros38
 
13.8%

Length

Histogram of lengths of the category

Pie chart

ValueCountFrequency (%)
curso147
53.5%
distancia45
 
16.4%
reputação45
 
16.4%
outros38
 
13.8%

Most occurring characters

ValueCountFrequency (%)
u230
13.0%
s230
13.0%
o230
13.0%
r185
10.4%
C147
8.3%
a135
7.6%
t128
7.2%
i90
 
5.1%
R45
 
2.5%
e45
 
2.5%
Other values (7)308
17.4%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter1498
84.5%
Uppercase Letter275
 
15.5%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
u230
15.4%
s230
15.4%
o230
15.4%
r185
12.3%
a135
9.0%
t128
8.5%
i90
 
6.0%
e45
 
3.0%
p45
 
3.0%
ç45
 
3.0%
Other values (3)135
9.0%
Uppercase Letter
ValueCountFrequency (%)
C147
53.5%
R45
 
16.4%
D45
 
16.4%
O38
 
13.8%

Most occurring scripts

ValueCountFrequency (%)
Latin1773
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
u230
13.0%
s230
13.0%
o230
13.0%
r185
10.4%
C147
8.3%
a135
7.6%
t128
7.2%
i90
 
5.1%
R45
 
2.5%
e45
 
2.5%
Other values (7)308
17.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII1683
94.9%
Latin 1 Sup90
 
5.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
u230
13.7%
s230
13.7%
o230
13.7%
r185
11.0%
C147
8.7%
a135
8.0%
t128
7.6%
i90
 
5.3%
R45
 
2.7%
e45
 
2.7%
Other values (5)218
13.0%
Latin 1 Sup
ValueCountFrequency (%)
ç45
50.0%
ã45
50.0%

FezMaternal
Categorical

Distinct2
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
Sim
219 
Não
56 

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters825
Distinct characters6
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowNão
2nd rowSim
3rd rowSim
4th rowSim
5th rowSim

Common Values

ValueCountFrequency (%)
Sim219
79.6%
Não56
 
20.4%

Length

Histogram of lengths of the category

Pie chart

ValueCountFrequency (%)
sim219
79.6%
não56
 
20.4%

Most occurring characters

ValueCountFrequency (%)
S219
26.5%
i219
26.5%
m219
26.5%
N56
 
6.8%
ã56
 
6.8%
o56
 
6.8%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter550
66.7%
Uppercase Letter275
33.3%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
i219
39.8%
m219
39.8%
ã56
 
10.2%
o56
 
10.2%
Uppercase Letter
ValueCountFrequency (%)
S219
79.6%
N56
 
20.4%

Most occurring scripts

ValueCountFrequency (%)
Latin825
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
S219
26.5%
i219
26.5%
m219
26.5%
N56
 
6.8%
ã56
 
6.8%
o56
 
6.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII769
93.2%
Latin 1 Sup56
 
6.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
S219
28.5%
i219
28.5%
m219
28.5%
N56
 
7.3%
o56
 
7.3%
Latin 1 Sup
ValueCountFrequency (%)
ã56
100.0%

Internet
Categorical

Distinct2
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
Sim
182 
Não
93 

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters825
Distinct characters6
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowSim
2nd rowSim
3rd rowSim
4th rowNão
5th rowNão

Common Values

ValueCountFrequency (%)
Sim182
66.2%
Não93
33.8%

Length

Histogram of lengths of the category

Pie chart

ValueCountFrequency (%)
sim182
66.2%
não93
33.8%

Most occurring characters

ValueCountFrequency (%)
S182
22.1%
i182
22.1%
m182
22.1%
N93
11.3%
ã93
11.3%
o93
11.3%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter550
66.7%
Uppercase Letter275
33.3%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
i182
33.1%
m182
33.1%
ã93
16.9%
o93
16.9%
Uppercase Letter
ValueCountFrequency (%)
S182
66.2%
N93
33.8%

Most occurring scripts

ValueCountFrequency (%)
Latin825
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
S182
22.1%
i182
22.1%
m182
22.1%
N93
11.3%
ã93
11.3%
o93
11.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII732
88.7%
Latin 1 Sup93
 
11.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
S182
24.9%
i182
24.9%
m182
24.9%
N93
12.7%
o93
12.7%
Latin 1 Sup
ValueCountFrequency (%)
ã93
100.0%

Guarda_Pt
Categorical

Distinct3
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
Mãe
185 
Pai
65 
Outro
25 

Length

Max length5
Median length3
Mean length3.181818182
Min length3

Characters and Unicode

Total characters875
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowMãe
2nd rowMãe
3rd rowMãe
4th rowMãe
5th rowMãe

Common Values

ValueCountFrequency (%)
Mãe185
67.3%
Pai65
 
23.6%
Outro25
 
9.1%

Length

Histogram of lengths of the category

Pie chart

ValueCountFrequency (%)
mãe185
67.3%
pai65
 
23.6%
outro25
 
9.1%

Most occurring characters

ValueCountFrequency (%)
M185
21.1%
ã185
21.1%
e185
21.1%
P65
 
7.4%
a65
 
7.4%
i65
 
7.4%
O25
 
2.9%
u25
 
2.9%
t25
 
2.9%
r25
 
2.9%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter600
68.6%
Uppercase Letter275
31.4%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
ã185
30.8%
e185
30.8%
a65
 
10.8%
i65
 
10.8%
u25
 
4.2%
t25
 
4.2%
r25
 
4.2%
o25
 
4.2%
Uppercase Letter
ValueCountFrequency (%)
M185
67.3%
P65
 
23.6%
O25
 
9.1%

Most occurring scripts

ValueCountFrequency (%)
Latin875
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
M185
21.1%
ã185
21.1%
e185
21.1%
P65
 
7.4%
a65
 
7.4%
i65
 
7.4%
O25
 
2.9%
u25
 
2.9%
t25
 
2.9%
r25
 
2.9%

Most occurring blocks

ValueCountFrequency (%)
ASCII690
78.9%
Latin 1 Sup185
 
21.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
M185
26.8%
e185
26.8%
P65
 
9.4%
a65
 
9.4%
i65
 
9.4%
O25
 
3.6%
u25
 
3.6%
t25
 
3.6%
r25
 
3.6%
o25
 
3.6%
Latin 1 Sup
ValueCountFrequency (%)
ã185
100.0%
Distinct4
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
1.0
122 
2.0
113 
3.0
32 
4.0
 
8

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters825
Distinct characters6
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2.0
2nd row1.0
3rd row3.0
4th row4.0
5th row1.0

Common Values

ValueCountFrequency (%)
1.0122
44.4%
2.0113
41.1%
3.032
 
11.6%
4.08
 
2.9%

Length

Histogram of lengths of the category

Pie chart

ValueCountFrequency (%)
1.0122
44.4%
2.0113
41.1%
3.032
 
11.6%
4.08
 
2.9%

Most occurring characters

ValueCountFrequency (%)
.275
33.3%
0275
33.3%
1122
14.8%
2113
13.7%
332
 
3.9%
48
 
1.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number550
66.7%
Other Punctuation275
33.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0275
50.0%
1122
22.2%
2113
20.5%
332
 
5.8%
48
 
1.5%
Other Punctuation
ValueCountFrequency (%)
.275
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common825
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
.275
33.3%
0275
33.3%
1122
14.8%
2113
13.7%
332
 
3.9%
48
 
1.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII825
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
.275
33.3%
0275
33.3%
1122
14.8%
2113
13.7%
332
 
3.9%
48
 
1.0%
Distinct4
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
2.0
118 
1.0
114 
3.0
35 
4.0
 
8

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters825
Distinct characters6
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row3.0
2nd row2.0
3rd row1.0
4th row1.0
5th row1.0

Common Values

ValueCountFrequency (%)
2.0118
42.9%
1.0114
41.5%
3.035
 
12.7%
4.08
 
2.9%

Length

Histogram of lengths of the category

Pie chart

ValueCountFrequency (%)
2.0118
42.9%
1.0114
41.5%
3.035
 
12.7%
4.08
 
2.9%

Most occurring characters

ValueCountFrequency (%)
.275
33.3%
0275
33.3%
2118
14.3%
1114
13.8%
335
 
4.2%
48
 
1.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number550
66.7%
Other Punctuation275
33.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0275
50.0%
2118
21.5%
1114
20.7%
335
 
6.4%
48
 
1.5%
Other Punctuation
ValueCountFrequency (%)
.275
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common825
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
.275
33.3%
0275
33.3%
2118
14.3%
1114
13.8%
335
 
4.2%
48
 
1.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII825
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
.275
33.3%
0275
33.3%
2118
14.3%
1114
13.8%
335
 
4.2%
48
 
1.0%

Reprovacoes_Pt
Categorical

HIGH CORRELATION

Distinct4
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
0.0
208 
1.0
49 
2.0
 
10
3.0
 
8

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters825
Distinct characters5
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0.0
2nd row3.0
3rd row0.0
4th row0.0
5th row1.0

Common Values

ValueCountFrequency (%)
0.0208
75.6%
1.049
 
17.8%
2.010
 
3.6%
3.08
 
2.9%

Length

Histogram of lengths of the category

Pie chart

ValueCountFrequency (%)
0.0208
75.6%
1.049
 
17.8%
2.010
 
3.6%
3.08
 
2.9%

Most occurring characters

ValueCountFrequency (%)
0483
58.5%
.275
33.3%
149
 
5.9%
210
 
1.2%
38
 
1.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number550
66.7%
Other Punctuation275
33.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0483
87.8%
149
 
8.9%
210
 
1.8%
38
 
1.5%
Other Punctuation
ValueCountFrequency (%)
.275
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common825
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0483
58.5%
.275
33.3%
149
 
5.9%
210
 
1.2%
38
 
1.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII825
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0483
58.5%
.275
33.3%
149
 
5.9%
210
 
1.2%
38
 
1.0%
Distinct2
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
Não
256 
Sim
 
19

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters825
Distinct characters6
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowNão
2nd rowNão
3rd rowNão
4th rowNão
5th rowNão

Common Values

ValueCountFrequency (%)
Não256
93.1%
Sim19
 
6.9%

Length

Histogram of lengths of the category

Pie chart

ValueCountFrequency (%)
não256
93.1%
sim19
 
6.9%

Most occurring characters

ValueCountFrequency (%)
N256
31.0%
ã256
31.0%
o256
31.0%
S19
 
2.3%
i19
 
2.3%
m19
 
2.3%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter550
66.7%
Uppercase Letter275
33.3%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
ã256
46.5%
o256
46.5%
i19
 
3.5%
m19
 
3.5%
Uppercase Letter
ValueCountFrequency (%)
N256
93.1%
S19
 
6.9%

Most occurring scripts

ValueCountFrequency (%)
Latin825
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
N256
31.0%
ã256
31.0%
o256
31.0%
S19
 
2.3%
i19
 
2.3%
m19
 
2.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII569
69.0%
Latin 1 Sup256
31.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
N256
45.0%
o256
45.0%
S19
 
3.3%
i19
 
3.3%
m19
 
3.3%
Latin 1 Sup
ValueCountFrequency (%)
ã256
100.0%
Distinct2
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
Sim
165 
Não
110 

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters825
Distinct characters6
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowSim
2nd rowSim
3rd rowSim
4th rowNão
5th rowNão

Common Values

ValueCountFrequency (%)
Sim165
60.0%
Não110
40.0%

Length

Histogram of lengths of the category

Pie chart

ValueCountFrequency (%)
sim165
60.0%
não110
40.0%

Most occurring characters

ValueCountFrequency (%)
S165
20.0%
i165
20.0%
m165
20.0%
N110
13.3%
ã110
13.3%
o110
13.3%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter550
66.7%
Uppercase Letter275
33.3%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
i165
30.0%
m165
30.0%
ã110
20.0%
o110
20.0%
Uppercase Letter
ValueCountFrequency (%)
S165
60.0%
N110
40.0%

Most occurring scripts

ValueCountFrequency (%)
Latin825
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
S165
20.0%
i165
20.0%
m165
20.0%
N110
13.3%
ã110
13.3%
o110
13.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII715
86.7%
Latin 1 Sup110
 
13.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
S165
23.1%
i165
23.1%
m165
23.1%
N110
15.4%
o110
15.4%
Latin 1 Sup
ValueCountFrequency (%)
ã110
100.0%
Distinct2
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
Não
262 
Sim
 
13

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters825
Distinct characters6
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowNão
2nd rowNão
3rd rowNão
4th rowNão
5th rowNão

Common Values

ValueCountFrequency (%)
Não262
95.3%
Sim13
 
4.7%

Length

Histogram of lengths of the category

Pie chart

ValueCountFrequency (%)
não262
95.3%
sim13
 
4.7%

Most occurring characters

ValueCountFrequency (%)
N262
31.8%
ã262
31.8%
o262
31.8%
S13
 
1.6%
i13
 
1.6%
m13
 
1.6%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter550
66.7%
Uppercase Letter275
33.3%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
ã262
47.6%
o262
47.6%
i13
 
2.4%
m13
 
2.4%
Uppercase Letter
ValueCountFrequency (%)
N262
95.3%
S13
 
4.7%

Most occurring scripts

ValueCountFrequency (%)
Latin825
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
N262
31.8%
ã262
31.8%
o262
31.8%
S13
 
1.6%
i13
 
1.6%
m13
 
1.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII563
68.2%
Latin 1 Sup262
31.8%

Most frequent character per block

ASCII
ValueCountFrequency (%)
N262
46.5%
o262
46.5%
S13
 
2.3%
i13
 
2.3%
m13
 
2.3%
Latin 1 Sup
ValueCountFrequency (%)
ã262
100.0%
Distinct2
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
Não
154 
Sim
121 

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters825
Distinct characters6
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowNão
2nd rowSim
3rd rowNão
4th rowNão
5th rowNão

Common Values

ValueCountFrequency (%)
Não154
56.0%
Sim121
44.0%

Length

Histogram of lengths of the category

Pie chart

ValueCountFrequency (%)
não154
56.0%
sim121
44.0%

Most occurring characters

ValueCountFrequency (%)
N154
18.7%
ã154
18.7%
o154
18.7%
S121
14.7%
i121
14.7%
m121
14.7%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter550
66.7%
Uppercase Letter275
33.3%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
ã154
28.0%
o154
28.0%
i121
22.0%
m121
22.0%
Uppercase Letter
ValueCountFrequency (%)
N154
56.0%
S121
44.0%

Most occurring scripts

ValueCountFrequency (%)
Latin825
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
N154
18.7%
ã154
18.7%
o154
18.7%
S121
14.7%
i121
14.7%
m121
14.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII671
81.3%
Latin 1 Sup154
 
18.7%

Most frequent character per block

ASCII
ValueCountFrequency (%)
N154
23.0%
o154
23.0%
S121
18.0%
i121
18.0%
m121
18.0%
Latin 1 Sup
ValueCountFrequency (%)
ã154
100.0%

QuerFaculdade_Pt
Categorical

Distinct2
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
Sim
222 
Não
53 

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters825
Distinct characters6
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowSim
2nd rowNão
3rd rowSim
4th rowSim
5th rowSim

Common Values

ValueCountFrequency (%)
Sim222
80.7%
Não53
 
19.3%

Length

Histogram of lengths of the category

Pie chart

ValueCountFrequency (%)
sim222
80.7%
não53
 
19.3%

Most occurring characters

ValueCountFrequency (%)
S222
26.9%
i222
26.9%
m222
26.9%
N53
 
6.4%
ã53
 
6.4%
o53
 
6.4%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter550
66.7%
Uppercase Letter275
33.3%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
i222
40.4%
m222
40.4%
ã53
 
9.6%
o53
 
9.6%
Uppercase Letter
ValueCountFrequency (%)
S222
80.7%
N53
 
19.3%

Most occurring scripts

ValueCountFrequency (%)
Latin825
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
S222
26.9%
i222
26.9%
m222
26.9%
N53
 
6.4%
ã53
 
6.4%
o53
 
6.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII772
93.6%
Latin 1 Sup53
 
6.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
S222
28.8%
i222
28.8%
m222
28.8%
N53
 
6.9%
o53
 
6.9%
Latin 1 Sup
ValueCountFrequency (%)
ã53
100.0%
Distinct2
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
Não
157 
Sim
118 

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters825
Distinct characters6
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowSim
2nd rowSim
3rd rowNão
4th rowNão
5th rowNão

Common Values

ValueCountFrequency (%)
Não157
57.1%
Sim118
42.9%

Length

Histogram of lengths of the category

Pie chart

ValueCountFrequency (%)
não157
57.1%
sim118
42.9%

Most occurring characters

ValueCountFrequency (%)
N157
19.0%
ã157
19.0%
o157
19.0%
S118
14.3%
i118
14.3%
m118
14.3%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter550
66.7%
Uppercase Letter275
33.3%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
ã157
28.5%
o157
28.5%
i118
21.5%
m118
21.5%
Uppercase Letter
ValueCountFrequency (%)
N157
57.1%
S118
42.9%

Most occurring scripts

ValueCountFrequency (%)
Latin825
100.0%

Most frequent character per script

Latin
ValueCountFrequency (%)
N157
19.0%
ã157
19.0%
o157
19.0%
S118
14.3%
i118
14.3%
m118
14.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII668
81.0%
Latin 1 Sup157
 
19.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
N157
23.5%
o157
23.5%
S118
17.7%
i118
17.7%
m118
17.7%
Latin 1 Sup
ValueCountFrequency (%)
ã157
100.0%
Distinct5
Distinct (%)1.8%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
4.0
136 
5.0
78 
3.0
36 
1.0
14 
2.0
 
11

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters825
Distinct characters7
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row3.0
2nd row5.0
3rd row3.0
4th row3.0
5th row5.0

Common Values

ValueCountFrequency (%)
4.0136
49.5%
5.078
28.4%
3.036
 
13.1%
1.014
 
5.1%
2.011
 
4.0%

Length

Histogram of lengths of the category

Pie chart

ValueCountFrequency (%)
4.0136
49.5%
5.078
28.4%
3.036
 
13.1%
1.014
 
5.1%
2.011
 
4.0%

Most occurring characters

ValueCountFrequency (%)
.275
33.3%
0275
33.3%
4136
16.5%
578
 
9.5%
336
 
4.4%
114
 
1.7%
211
 
1.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number550
66.7%
Other Punctuation275
33.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0275
50.0%
4136
24.7%
578
 
14.2%
336
 
6.5%
114
 
2.5%
211
 
2.0%
Other Punctuation
ValueCountFrequency (%)
.275
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common825
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
.275
33.3%
0275
33.3%
4136
16.5%
578
 
9.5%
336
 
4.4%
114
 
1.7%
211
 
1.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII825
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
.275
33.3%
0275
33.3%
4136
16.5%
578
 
9.5%
336
 
4.4%
114
 
1.7%
211
 
1.3%

TempoLivre_Pt
Categorical

HIGH CORRELATION

Distinct5
Distinct (%)1.8%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
3.0
97 
4.0
72 
2.0
47 
5.0
31 
1.0
28 

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters825
Distinct characters7
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2.0
2nd row4.0
3rd row2.0
4th row2.0
5th row3.0

Common Values

ValueCountFrequency (%)
3.097
35.3%
4.072
26.2%
2.047
17.1%
5.031
 
11.3%
1.028
 
10.2%

Length

Histogram of lengths of the category

Pie chart

ValueCountFrequency (%)
3.097
35.3%
4.072
26.2%
2.047
17.1%
5.031
 
11.3%
1.028
 
10.2%

Most occurring characters

ValueCountFrequency (%)
.275
33.3%
0275
33.3%
397
 
11.8%
472
 
8.7%
247
 
5.7%
531
 
3.8%
128
 
3.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number550
66.7%
Other Punctuation275
33.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0275
50.0%
397
 
17.6%
472
 
13.1%
247
 
8.5%
531
 
5.6%
128
 
5.1%
Other Punctuation
ValueCountFrequency (%)
.275
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common825
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
.275
33.3%
0275
33.3%
397
 
11.8%
472
 
8.7%
247
 
5.7%
531
 
3.8%
128
 
3.4%

Most occurring blocks

ValueCountFrequency (%)
ASCII825
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
.275
33.3%
0275
33.3%
397
 
11.8%
472
 
8.7%
247
 
5.7%
531
 
3.8%
128
 
3.4%

SaiComAmigos_Pt
Categorical

HIGH CORRELATION

Distinct5
Distinct (%)1.8%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
3.0
84 
4.0
62 
5.0
56 
2.0
48 
1.0
25 

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters825
Distinct characters7
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row3.0
2nd row5.0
3rd row2.0
4th row1.0
5th row4.0

Common Values

ValueCountFrequency (%)
3.084
30.5%
4.062
22.5%
5.056
20.4%
2.048
17.5%
1.025
 
9.1%

Length

Histogram of lengths of the category

Pie chart

ValueCountFrequency (%)
3.084
30.5%
4.062
22.5%
5.056
20.4%
2.048
17.5%
1.025
 
9.1%

Most occurring characters

ValueCountFrequency (%)
.275
33.3%
0275
33.3%
384
 
10.2%
462
 
7.5%
556
 
6.8%
248
 
5.8%
125
 
3.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number550
66.7%
Other Punctuation275
33.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0275
50.0%
384
 
15.3%
462
 
11.3%
556
 
10.2%
248
 
8.7%
125
 
4.5%
Other Punctuation
ValueCountFrequency (%)
.275
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common825
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
.275
33.3%
0275
33.3%
384
 
10.2%
462
 
7.5%
556
 
6.8%
248
 
5.8%
125
 
3.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII825
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
.275
33.3%
0275
33.3%
384
 
10.2%
462
 
7.5%
556
 
6.8%
248
 
5.8%
125
 
3.0%

ConsumoAlcoolDiaUtil_Pt
Categorical

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct5
Distinct (%)1.8%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
1.0
190 
2.0
49 
3.0
 
19
4.0
 
9
5.0
 
8

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters825
Distinct characters7
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2.0
2nd row1.0
3rd row1.0
4th row1.0
5th row1.0

Common Values

ValueCountFrequency (%)
1.0190
69.1%
2.049
 
17.8%
3.019
 
6.9%
4.09
 
3.3%
5.08
 
2.9%

Length

Histogram of lengths of the category

Pie chart

ValueCountFrequency (%)
1.0190
69.1%
2.049
 
17.8%
3.019
 
6.9%
4.09
 
3.3%
5.08
 
2.9%

Most occurring characters

ValueCountFrequency (%)
.275
33.3%
0275
33.3%
1190
23.0%
249
 
5.9%
319
 
2.3%
49
 
1.1%
58
 
1.0%

Most occurring categories

ValueCountFrequency (%)
Decimal Number550
66.7%
Other Punctuation275
33.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0275
50.0%
1190
34.5%
249
 
8.9%
319
 
3.5%
49
 
1.6%
58
 
1.5%
Other Punctuation
ValueCountFrequency (%)
.275
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common825
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
.275
33.3%
0275
33.3%
1190
23.0%
249
 
5.9%
319
 
2.3%
49
 
1.1%
58
 
1.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII825
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
.275
33.3%
0275
33.3%
1190
23.0%
249
 
5.9%
319
 
2.3%
49
 
1.1%
58
 
1.0%

ConsumoAlcoolFimSemana_Pt
Categorical

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct5
Distinct (%)1.8%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
1.0
106 
2.0
68 
3.0
44 
4.0
39 
5.0
18 

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters825
Distinct characters7
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2.0
2nd row3.0
3rd row2.0
4th row1.0
5th row1.0

Common Values

ValueCountFrequency (%)
1.0106
38.5%
2.068
24.7%
3.044
16.0%
4.039
 
14.2%
5.018
 
6.5%

Length

Histogram of lengths of the category

Pie chart

ValueCountFrequency (%)
1.0106
38.5%
2.068
24.7%
3.044
16.0%
4.039
 
14.2%
5.018
 
6.5%

Most occurring characters

ValueCountFrequency (%)
.275
33.3%
0275
33.3%
1106
 
12.8%
268
 
8.2%
344
 
5.3%
439
 
4.7%
518
 
2.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number550
66.7%
Other Punctuation275
33.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0275
50.0%
1106
 
19.3%
268
 
12.4%
344
 
8.0%
439
 
7.1%
518
 
3.3%
Other Punctuation
ValueCountFrequency (%)
.275
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common825
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
.275
33.3%
0275
33.3%
1106
 
12.8%
268
 
8.2%
344
 
5.3%
439
 
4.7%
518
 
2.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII825
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
.275
33.3%
0275
33.3%
1106
 
12.8%
268
 
8.2%
344
 
5.3%
439
 
4.7%
518
 
2.2%

SitSaude_Pt
Categorical

Distinct5
Distinct (%)1.8%
Missing0
Missing (%)0.0%
Memory size2.3 KiB
5.0
106 
4.0
46 
1.0
44 
3.0
44 
2.0
35 

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters825
Distinct characters7
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1.0
2nd row5.0
3rd row5.0
4th row2.0
5th row5.0

Common Values

ValueCountFrequency (%)
5.0106
38.5%
4.046
16.7%
1.044
16.0%
3.044
16.0%
2.035
 
12.7%

Length

Histogram of lengths of the category

Pie chart

ValueCountFrequency (%)
5.0106
38.5%
4.046
16.7%
1.044
16.0%
3.044
16.0%
2.035
 
12.7%

Most occurring characters

ValueCountFrequency (%)
.275
33.3%
0275
33.3%
5106
 
12.8%
446
 
5.6%
144
 
5.3%
344
 
5.3%
235
 
4.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number550
66.7%
Other Punctuation275
33.3%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0275
50.0%
5106
 
19.3%
446
 
8.4%
144
 
8.0%
344
 
8.0%
235
 
6.4%
Other Punctuation
ValueCountFrequency (%)
.275
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common825
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
.275
33.3%
0275
33.3%
5106
 
12.8%
446
 
5.6%
144
 
5.3%
344
 
5.3%
235
 
4.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII825
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
.275
33.3%
0275
33.3%
5106
 
12.8%
446
 
5.6%
144
 
5.3%
344
 
5.3%
235
 
4.2%

Faltas_Pt
Real number (ℝ≥0)

HIGH CORRELATION
ZEROS

Distinct20
Distinct (%)7.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.730909091
Minimum0
Maximum26
Zeros97
Zeros (%)35.3%
Negative0
Negative (%)0.0%
Memory size2.3 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median2
Q36
95-th percentile11.3
Maximum26
Range26
Interquartile range (IQR)6

Descriptive statistics

Standard deviation4.408058013
Coefficient of variation (CV)1.181497031
Kurtosis3.96585362
Mean3.730909091
Median Absolute Deviation (MAD)2
Skewness1.682934482
Sum1026
Variance19.43097545
MonotonicityNot monotonic
Histogram with fixed size bins (bins=20)
ValueCountFrequency (%)
097
35.3%
245
16.4%
433
 
12.0%
823
 
8.4%
613
 
4.7%
1012
 
4.4%
512
 
4.4%
19
 
3.3%
96
 
2.2%
35
 
1.8%
Other values (10)20
 
7.3%
ValueCountFrequency (%)
097
35.3%
19
 
3.3%
245
16.4%
35
 
1.8%
433
 
12.0%
512
 
4.4%
613
 
4.7%
71
 
0.4%
823
 
8.4%
96
 
2.2%
ValueCountFrequency (%)
261
 
0.4%
241
 
0.4%
211
 
0.4%
181
 
0.4%
162
 
0.7%
142
 
0.7%
131
 
0.4%
125
1.8%
115
1.8%
1012
4.4%

NotaP1_Pt
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct15
Distinct (%)5.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean10.40363636
Minimum4
Maximum18
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.3 KiB

Quantile statistics

Minimum4
5-th percentile6
Q19
median10
Q312
95-th percentile15
Maximum18
Range14
Interquartile range (IQR)3

Descriptive statistics

Standard deviation2.680148952
Coefficient of variation (CV)0.2576165543
Kurtosis-0.05357241776
Mean10.40363636
Median Absolute Deviation (MAD)2
Skewness0.3199744573
Sum2861
Variance7.183198407
MonotonicityNot monotonic
Histogram with fixed size bins (bins=15)
ValueCountFrequency (%)
1045
16.4%
1141
14.9%
938
13.8%
829
10.5%
1225
9.1%
723
8.4%
1421
7.6%
1320
7.3%
69
 
3.3%
157
 
2.5%
Other values (5)17
 
6.2%
ValueCountFrequency (%)
42
 
0.7%
54
 
1.5%
69
 
3.3%
723
8.4%
829
10.5%
938
13.8%
1045
16.4%
1141
14.9%
1225
9.1%
1320
7.3%
ValueCountFrequency (%)
182
 
0.7%
174
 
1.5%
165
 
1.8%
157
 
2.5%
1421
7.6%
1320
7.3%
1225
9.1%
1141
14.9%
1045
16.4%
938
13.8%

NotaP2_Pt
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
ZEROS

Distinct15
Distinct (%)5.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean10.62909091
Minimum0
Maximum18
Zeros7
Zeros (%)2.5%
Negative0
Negative (%)0.0%
Memory size2.3 KiB

Quantile statistics

Minimum0
5-th percentile6
Q19
median10
Q312.5
95-th percentile16
Maximum18
Range18
Interquartile range (IQR)3.5

Descriptive statistics

Standard deviation3.203651699
Coefficient of variation (CV)0.3014041112
Kurtosis1.849254242
Mean10.62909091
Median Absolute Deviation (MAD)2
Skewness-0.4267584494
Sum2923
Variance10.26338421
MonotonicityNot monotonic
Histogram with fixed size bins (bins=15)
ValueCountFrequency (%)
1046
16.7%
1143
15.6%
939
14.2%
827
9.8%
1223
8.4%
1421
7.6%
1320
7.3%
712
 
4.4%
159
 
3.3%
178
 
2.9%
Other values (5)27
9.8%
ValueCountFrequency (%)
07
 
2.5%
52
 
0.7%
67
 
2.5%
712
 
4.4%
827
9.8%
939
14.2%
1046
16.7%
1143
15.6%
1223
8.4%
1320
7.3%
ValueCountFrequency (%)
185
 
1.8%
178
 
2.9%
166
 
2.2%
159
 
3.3%
1421
7.6%
1320
7.3%
1223
8.4%
1143
15.6%
1046
16.7%
939
14.2%

NotaP3_Pt
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
ZEROS

Distinct14
Distinct (%)5.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean11.04
Minimum0
Maximum18
Zeros10
Zeros (%)3.6%
Negative0
Negative (%)0.0%
Memory size2.3 KiB

Quantile statistics

Minimum0
5-th percentile7
Q19
median11
Q313
95-th percentile17
Maximum18
Range18
Interquartile range (IQR)4

Descriptive statistics

Standard deviation3.403819237
Coefficient of variation (CV)0.3083169599
Kurtosis2.290681549
Mean11.04
Median Absolute Deviation (MAD)2
Skewness-0.763301271
Sum3036
Variance11.5859854
MonotonicityNot monotonic
Histogram with fixed size bins (bins=14)
ValueCountFrequency (%)
1052
18.9%
1150
18.2%
828
10.2%
924
8.7%
1424
8.7%
1321
7.6%
1218
 
6.5%
1516
 
5.8%
010
 
3.6%
1710
 
3.6%
Other values (4)22
8.0%
ValueCountFrequency (%)
010
 
3.6%
61
 
0.4%
77
 
2.5%
828
10.2%
924
8.7%
1052
18.9%
1150
18.2%
1218
 
6.5%
1321
7.6%
1424
8.7%
ValueCountFrequency (%)
186
 
2.2%
1710
 
3.6%
168
 
2.9%
1516
 
5.8%
1424
8.7%
1321
7.6%
1218
 
6.5%
1150
18.2%
1052
18.9%
924
8.7%

MediaNotas_Pt
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct45
Distinct (%)16.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean10.69090909
Minimum1.333333333
Maximum18
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.3 KiB

Quantile statistics

Minimum1.333333333
5-th percentile6.666666667
Q19
median10.33333333
Q312.5
95-th percentile16
Maximum18
Range16.66666667
Interquartile range (IQR)3.5

Descriptive statistics

Standard deviation2.954956896
Coefficient of variation (CV)0.2763990294
Kurtosis0.8054841374
Mean10.69090909
Median Absolute Deviation (MAD)1.666666667
Skewness-0.133006766
Sum2940
Variance8.731770257
MonotonicityNot monotonic
Histogram with fixed size bins (bins=45)
ValueCountFrequency (%)
1021
 
7.6%
10.6666666718
 
6.5%
8.66666666716
 
5.8%
9.66666666716
 
5.8%
10.3333333315
 
5.5%
913
 
4.7%
12.3333333313
 
4.7%
9.33333333312
 
4.4%
1111
 
4.0%
1310
 
3.6%
Other values (35)130
47.3%
ValueCountFrequency (%)
1.3333333331
0.4%
1.6666666672
0.7%
2.3333333332
0.7%
2.6666666671
0.4%
31
0.4%
4.6666666671
0.4%
51
0.4%
5.3333333331
0.4%
5.6666666672
0.7%
6.3333333331
0.4%
ValueCountFrequency (%)
182
0.7%
17.666666672
0.7%
17.333333331
 
0.4%
172
0.7%
16.666666671
 
0.4%
16.333333333
1.1%
164
1.5%
15.666666671
 
0.4%
15.333333334
1.5%
153
1.1%

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Cramér's V (φc)

Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.

Missing values

A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

First rows

df_indexEscolaGeneroIdadeEnderecoTamFamiliaStatusPaisInstrucaoMaeInstrucaoPaiOcupacaoMaeOcupacaoPaiMotivoEscolhaFezMaternalInternetGuarda_PtTempoViagemEscola_PtTempoEstudoSemanal_PtReprovacoes_PtAjudaExtraEscola_PtAjudaEduFamiliar_PtAulasParticulares_PtAtividadesExtra_PtQuerFaculdade_PtEmUmRelacionamento_PtRelacaoFamiliar_PtTempoLivre_PtSaiComAmigos_PtConsumoAlcoolDiaUtil_PtConsumoAlcoolFimSemana_PtSitSaude_PtFaltas_PtNotaP1_PtNotaP2_PtNotaP3_PtMediaNotas_Pt
0407Gabriel PereiraMasculino16UrbanoGT3JuntosFundamental2EMedioOutrasOutrasCursoNãoSimMãe2.03.00.0NãoSimNãoNãoSimSim3.02.03.02.02.01.04.013.012.013.012.666667
1408Gabriel PereiraFeminino18UrbanoGT3JuntosFundamental2Fundamental1ServidorOutrasReputaçãoSimSimMãe1.02.03.0NãoSimNãoSimNãoSim5.04.05.01.03.05.010.010.09.08.09.000000
2409Gabriel PereiraFeminino17UrbanoLE3SeparadosFundamental2Fundamental1OutrasOutrasCursoSimSimMãe3.01.00.0NãoSimNãoNãoSimNão3.02.02.01.02.05.08.011.010.011.010.666667
3410Gabriel PereiraMasculino16UrbanoGT3JuntosFundamental2Fundamental1Do LarOutrasCursoSimNãoMãe4.01.00.0NãoNãoNãoNãoSimNão3.02.01.01.01.02.04.09.09.011.09.666667
4411Gabriel PereiraFeminino16UrbanoGT3SeparadosFundamental2Fundamental2OutrasOutrasDistanciaSimNãoMãe1.01.01.0NãoNãoNãoNãoSimNão5.03.04.01.01.05.012.013.011.011.011.666667
5412Gabriel PereiraFeminino16RuralGT3JuntosFundamental1Fundamental1Do LarOutrasCursoSimNãoMãe4.02.00.0NãoSimNãoNãoSimNão5.01.03.01.01.03.00.014.013.013.013.333333
6413Gabriel PereiraMasculino18UrbanoLE3JuntosEMedioFundamental1ServidorServidorCursoSimSimMãe2.01.00.0NãoNãoNãoSimSimSim3.03.04.04.05.04.02.011.011.012.011.333333
7414Gabriel PereiraFeminino18UrbanoGT3SeparadosEMedioFundamental2OutrasServidorCursoNãoSimOutro1.03.00.0NãoSimNãoSimSimSim4.03.03.05.01.05.010.012.011.011.011.333333
8415Gabriel PereiraFeminino16RuralGT3JuntosFundamental1Fundamental1OutrasServidorReputaçãoSimNãoMãe2.01.00.0NãoSimNãoSimSimSim3.03.03.01.02.01.08.012.011.011.011.333333
9416Gabriel PereiraFeminino15RuralGT3JuntosFundamental1Fundamental1OutrasOutrasCursoSimSimMãe3.01.01.0NãoNãoNãoSimSimSim5.05.05.01.01.01.02.08.09.09.08.666667

Last rows

df_indexEscolaGeneroIdadeEnderecoTamFamiliaStatusPaisInstrucaoMaeInstrucaoPaiOcupacaoMaeOcupacaoPaiMotivoEscolhaFezMaternalInternetGuarda_PtTempoViagemEscola_PtTempoEstudoSemanal_PtReprovacoes_PtAjudaExtraEscola_PtAjudaEduFamiliar_PtAulasParticulares_PtAtividadesExtra_PtQuerFaculdade_PtEmUmRelacionamento_PtRelacaoFamiliar_PtTempoLivre_PtSaiComAmigos_PtConsumoAlcoolDiaUtil_PtConsumoAlcoolFimSemana_PtSitSaude_PtFaltas_PtNotaP1_PtNotaP2_PtNotaP3_PtMediaNotas_Pt
265672Mousinho da SilveiraFeminino19UrbanoGT3JuntosFundamental1Fundamental1Do LarServidorOutrosSimNãoPai2.01.01.0NãoNãoNãoNãoNãoNão5.05.05.02.03.02.00.05.00.00.01.666667
266673Mousinho da SilveiraFeminino17UrbanoGT3JuntosSuperiorFundamental2ProfessorOutrasCursoSimSimPai2.04.00.0NãoNãoNãoNãoSimSim4.02.03.03.01.05.00.018.018.018.018.000000
267674Mousinho da SilveiraFeminino17RuralLE3SeparadosFundamental2Fundamental1ServidorOutrasReputaçãoSimSimMãe2.02.00.0NãoNãoNãoSimSimSim5.03.03.01.02.02.05.011.011.012.011.333333
268675Mousinho da SilveiraFeminino18UrbanoLE3SeparadosFundamental1Fundamental1Do LarServidorCursoSimNãoMãe1.02.00.0NãoNãoNãoNãoSimSim5.02.03.01.02.03.02.08.010.011.09.666667
269676Mousinho da SilveiraFeminino18UrbanoGT3JuntosFundamental1Fundamental2Do LarDo LarCursoSimNãoPai2.02.00.0NãoSimNãoNãoNãoNão4.01.01.01.01.04.00.011.011.012.011.333333
270677Mousinho da SilveiraFeminino19RuralGT3SeparadosFundamental1Fundamental1Do LarDo LarCursoSimNãoOutro2.02.03.0NãoSimNãoSimNãoSim3.05.04.01.04.01.00.08.00.00.02.666667
271678Mousinho da SilveiraFeminino18RuralGT3JuntosFundamental2Fundamental2ServidorOutrasDistanciaSimSimMãe2.03.00.0NãoNãoNãoNãoSimSim4.02.01.01.01.04.05.014.014.015.014.333333
272679Mousinho da SilveiraFeminino18RuralLE3SeparadosFundamental1Fundamental2Do LarOutrasCursoSimNãoMãe3.02.00.0NãoNãoNãoNãoSimSim4.03.04.01.04.05.00.016.015.015.015.333333
273680Mousinho da SilveiraFeminino19RuralGT3JuntosFundamental1Fundamental1Do LarOutrasCursoSimSimOutro2.02.01.0NãoSimNãoNãoSimSim4.03.03.01.01.03.04.07.08.09.08.000000
274681Mousinho da SilveiraFeminino17UrbanoGT3JuntosSuperiorEMedioProfessorOutrasOutrosSimSimMãe2.02.00.0NãoNãoNãoNãoSimNão5.05.04.01.01.01.00.06.09.011.08.666667